BIOTECHNOLOGY '^0 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific nnd Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER; 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703-308-4212; FAX: 703-3084221 
Effective 12/13/03 : TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

liUn://www.iisn(o.gov/wcb/ofriccs/pac/clicckcr/chkr41 note, htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be awnrc ih.il the 
si possibility Lhn( llic disWCD-Rom may have been affected by treatment given to nil incoming mail. 
Please consider using alternate methods of submission for U^tfisk/CD-Rom or replacement disk/CD-Rom. 
Any reply including a sequence listing in electronic fornrdfOnld NOT be sent to t he 2023 1 zip code address for the 
United States Patent and Trademark Office, and instead should be sent via the follow ing to the indicated addresses; 

L EFS-Bio (<litln://ww\v.iisnto.gov/ebc/crs/downlonds/doc»mcnts.htm> , EFS Submission 

User Manual - cPAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 2231.1-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Patent "and Trademark Office, Box Sequence, Customer Window, Lobby, Room IB03, Crystal PIh/ji Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, oe other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Roonv4D03-Mailroom, CiyfiUl Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 




Revised 10/08/03 



Raw Sequence Listing Error Summary 



i: 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: ' 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH u ALri!A" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 Wrapped Nuclcics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

2 Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 Misaligned Amino The numbering under each 5 lh amino acid is misaligned. Do not use tab codes between numbers; 

Numbering use space characters, instead. 

4 Non-ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII text. 

5 Variable Length Sequcnce(s) contain n*s or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

6 Patentln 2.0 A "bug" in Patentln version 2.0 has caused. the <220>-<223> section to be missing from amino acid 

"bug" sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

7 Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

8 Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence. 

(NEW RULES) <2I0> sequence id number 
<400> sequence id number 
000 



Use of n's or Xaa'; 
.'(NEW RULES) 



Jnvalid <213> 
Response 



Uscof<220> 



12 



_PatentIn2.0 
""bug- 



Use of n's and/orXaa's have been detected in the Sequence Listing. 

Per 1 .823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1 .823 of Sequence Rules, the only valid <2 1 3> responses arc: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 

Sequcnce(s) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <2I3> "Organism" response is "Artificial Sequence" nr 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 
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IFW16 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/031 , 4 96B 



DATE: 03/12/2004 
TIME: 14:56:14 



3 
5 
7 
9 
10 
12 
14 
16 
17 
18 
19. 
21 
22 
25 
26 
27 
28 
30 
31 
34 
35 
36 
37 
39 
4 0 
4 3 
44 
45 
46 
49 
50 
51 
53 
54 
55 
57 
58 
59 
61 
62 
63 
65 
66 



<110> 
<120> 
<130> 
<14'0> 
<141> 
<160> 
<17 0> 
<210> 
<211> 
<212> 
<213> 
<400> 



Input Set : A:\NREL 99-45.ST25.txt 
Output Set: N:\CRF4\03122004\J031496B.raw 

APPLICANT: National Renewable Energy Laboratory 

TITLE OF INVENTION: Cellobiohydrols.se I G^ne and Improved Variants 

FILE REFERENCE: NREL 99-4 5 

CURRENT APPLICATION NUMBER: 10/031, 496B 

CURRENT FILING DATE: 2002-01-14 

NUMBER OF SEQ ID NOS : 120 

SOFTWARE: Patentln version 3.2 

SEQ 10 N0: \^JUI^\V^7^ 



LENGTH: 2 8 
TYPE: DNA 



OKGANIS* 
SEQUENCE: 




agagagteta gacacgqagc ttacaggc 
<210> 
<211> 
<212> 
<213> 
<400> 



SEQ ID NO: 2 

LENGTH: 35 

TYPE: UNA"' "" f 
ORGANISM: Synthetic 
S EQUENCET^ — 



28 



DNA 



aaagaagcgc ggccgcgcct 
<210> SEQ ID NO: 3 
LENGTH: 24, 
TYPE: TMfC 



gcactctcca aLcgg 



35 



<211> 

<212> , 

<213> ORGANISM: 'Synthetic DNA 

<4 00> SEQUENCET"^ " ' 

ggcggaaacc cgcctggcac cacc 



<210> 


SFQ ID NO 


4 


<21 1> 


LENGTH : 1550 


<212> 


TYPE: DNA 




<213> 


ORGANISM : 


Trichoderma roesei 


<220> 


FEATURE : 




<221> 


NAME /KEY : 


misc signal 


<222> 


LOCATION: 


!D . . (51) 


<220> 


FEATURE : 




<221> 


NAME /KEY: 


CDS 


<222> 


LOCATION: 


(3) . . (1550) 


<220> 


FEATURE: 




<221> 


NAME /KEY: 


misc feature 


<222> 


LOCATION: 


(52)". . (1.344) 


<220> 


FEATURE: 




<221> 


NAME /KEY : 


misc feature 


<222> 


LOCATION : 


(1345) . . (1435) 


<220> 


FEATURE: 




<221> 


NAME /KEY : 


misc ^binding 



24 



file://C:\CRF4\Outhold\VsrJ031496B.htm 
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RAW SEQUENCE LISTING DATE: 03/12/2004 

PATENT APPLICATION: US/10/031 , 496B TIMF 1 ,: 14:56:14 

Input Set : A:\NREL 99-45.ST25.txt 
Output Set: N : \CRF4\O31220O4\ J031496B, raw 



67 <?.22> LOCATION : 


(1436) . . 


(1550) 




















69 <400> SEQUENCE: 


4 


























7 0 at gta teg gaa gtt ggc cgt c 


at etc gqc ctt ctt ggc cac age teg 


4 7 


71 Val Ser G 


lu Val G 


."ly Arg His Leu Gly Leu Leu Gly .His Ser Ser 




7 2 1 








5 










10 








15 




7 4 tgc 


tea 


gt c 


acre 


etq 


cac 


tct 


cca 


at c 


gga 


gac 


tea 


ccc 


gec 


tct 


gac 


95 


75 Cys 


Ser 


Val 


Gly 


Leu 


His 


Ser 


Pro 


He 


Gly 


Asp 


Ser 


Pro 


Ala 


Ser 


Asp 




76 








20 










25 










30 






7 8 atg 


gea 


gaa 


atg 


etc 


gt c 


t aa 

^ rJ J 


tea 
v j j 


cac 


gtg 


cac 


tea 


aca 


gac 


agg 


etc 


143 


7 9 Met 


Ala 


Glu 


Met 


Lei: 


Val 


Trp 


Trp 


His 


Val 


His 


Ser 


Thr 


Asp 


Arg 


lieu 




80 






35 










40 










45 








82 cgt 


ggt 


cat 


cga 


egc 


caa 


eta 


aca 


ctg 


qac 

v^. ^ 


tea 


egc 


tac 


gaa 


cag 


cag 


191 


83 Arg Gly 


His 


Arg 


Arg 


Gin 


Leu 


Ala 


Leu 


Asp 


Ser Arg 


Tyr 


Glu 


Gin 


Gin 




8 4 




50 










55 










60 










8 6 c a p. 


gaa 


rt a 


eta 


ega 


t aa 


caa 


cac 


ttg 


aaa 


etc 


gac 


cct 


atg 


tec 


tga 


239 


87 His 


Glu 


Leu 


Leu 


Arg 


Trp 


Gin 


His 


Leu 


Glu 


Leu Asp 


Pro 


Met 


Ser 






o o 


6S 










70 










75 












_? U U d ci 


c ga 


gac 


ctg 


egc 


gaa 


gaa 


eta 


ctg 


tct 


gga 


egg 


tac 


cac 


eta 


cac 


287 


91 Gin 


Arg 


Asp 


Leu 


Arg 


Glu 


Glu 


Leu 


Leu 


Ser 


Gly Arg 


Cys 


Arg 


Leu 


Ara 




Q9 


R 0 










8 5 










90 












Q /l rr t* n 

_7 'i tj I. v.. 




gta 


ran 


a at 


tac 


ca c 


aa a 


caa 


t aa 


cag 


cct 


etc 


eat 


taa 


ctt 


335 


95 Val 


His 


Val 


Arg 


Se.r 


Tvr 


His 


Glu 


Arg 




Gin 


Pro 


Leu 


Pis 


Trp 


Leu 




96 95 










100 












105 












98 tgt 


cac 


cca 


ate 


t ac 


gea 


gaa 


aaa 


cgt 


tgg 


egc 


tog 


cct 


tta 


cct 


tat 


383 


99 Cys 


His 


Pro 


Val 




Ala 


Glu 


Glu 


Arg 


Trp 


Arg Ser 


Pro 


Leu 


Pro 


Tvr 




1.0 0 110 










115 










120 










125 




102 ggc 


gag 


cga 


cac 


ga c 


eta 


cca 


aaa 


att 


cac 


act 


get 


taa 


caa 


caa 


gtt 


431 


103 Gly 


Glu 


Arg 


His 


Asp 


Leu 


Pro 


Gly 


lie 


His' 


Pro 


Ala 


Trp 


Gin 


Arg 


Val 




104 








130 










135 










14 0 






106 etc 


ttt 


cga 


tnt 


t.ga 


tat 


tte 


gea 


get 


gec 


gtg 


egg 


ctt 


gaa 


egg 


age 


479 


107 Leu 


Phe 


Arq 


Cvs 




Cys 


Phe 


Ala 


Ala 


Ala 


Val 


Arg 


Leu 


Glu 


Arg 


Ser 




108 






145 












150 










15 5 






110 tct 


eta 


ctt 


cgt 


gtc 


cat 


gga 


egc 


gga 


tgg 


tgg 


cgt 


gag 


caa 


gta 


tec 


527 


111 Ser 


Leu 


Leu 


Arg 


Val 


' His 


Gly 


Arg 


Gly 


Trp 


Trp 


Arg 


Glu 


Gin 


Val 


Ser 




112 






160 










165 










170 








114 cac 


caa 


cac 


egc 


tgg 


egc 


caa 


gta 


egg 


cac 


ggg 


gta 


ctg 


tga 


cag 


cca 


575 


lib His 


Gin 


His 


Arg 


Trp 


Arg 


Gin 


Val 


Arg 


His 


Gly 


Val 


Leu 




Gin 


Pro 




116 




175 










180 










18 5 










118 gtg 


tec 


ccg 


cga 


tct 


gaa 


gtt 


cat 


caa 


tgg 


cca 


ggc 


caa 


cgt 


tga 


ggg 


623 


11.9 Val 


Ser 


Pro 


Arg 


Ser 


Glu 


Val 


His 


G J. n 


Trp 


Pro 


Gly 


Gin 


Arg 




Gly 




120 




190 










195 










20 0 










122 ctg 


gga 


gee 


gtc 


ate 


caa 


caa 


egc 


gaa 


cac 


ggg 


cat 


tgg 


agg 


aca 


egg 


671 


123 Leu 


Gly 


Ala 


Val 


He 


Gin 


Gin 


Arg 


Glu 


His 


Gly 


His 


Trp 


Arg 


Thr 


Arg 




124 




205 










210 










215 










126 aag 


ctg 


ctg 


etc 


tga 


gat 


gga 


tat 


ctg 


gga 


ggc 


caa 


etc 


eat 


etc 


ega 


719 


12 7 I,ys 


Leu 


Leu 


Leu 




Asp 


Gly 


Tyr 


Leu 


Gly 


Gly 


Gin 


Leu 


His 


Leu 


Arg 




128 


220 












225 










230 










130 ggc 


tct 


tac 


ccc 


cca 


ccc 


ttg 


cac 


gac 


tgt 


egc 


cca 


gga 


gat 


ctg 


cga 


7 67 


131 Gly 


Ser 


Tyr 


Pro 


Pro 


Pro 


Leu 


His 


Asp 


Cys 


Arg 


Pro 


Gly 


Asp 


Leu 


Arg 
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RAW SEQUENCE LISTING DATE; 03/12/2004 

PATENT APPLICATION: US/10/031 , 4 96B TIME: 14:56:14 

Input Set : A:\NREL 99-45.ST25.txt 
Output Set: N:\CRF4\03122004\J031496B.raw 



132 




235 










240 










24 5 












1 34 


ggg 


tga 


tgg 


gtg 


egg 


egg 


aac 


tta 


etc 


cga 


taa 


cag 


ata 


tgg 


egg 


cac 


81.5 


135 


Gly 




Trp 


Val 


Arg Arg 


Asn 


Leu 


Lou 


Arg 




Gin 


He 


Trp 


Arg 


His 




136 


250 












255 












260 










138 


ttg 


eg a 


tec 


cga 


tgg 


ctg 


cga 


ctg 


gaa 


ccc 


at a 


ccg 


cct 


ggg 


caa 


cac 


8 63 


139 


Leu 


Arg 


Ser 


Arg 


Trp 


Leu 


Arg 


Leu 


G3u 


Pro 


lie 


Pro 


Pro 


Gly 


Gin 


His 




140 




265 










270 










27 5 












142 


caq 


ctt 


eta 


egg 


ccc 


tea 


etc 


aag 


ctt 


tac 


cct 


cga 


tac 


cac 


caa 


gaa 


911 


.143 


Gin 


Lou 


Leu 


hrg 


Pro 


Tro 


Leu 


Lys 


Leu 


Tyr 


Pro 


Arg 


Tyr 


His 


Gin 


Glu 




14 4 


280 










285 










290 










295 




146 


att 


gac 


eg t 


tgt 


cac 


cca 


gtt 


cga 


gac 


gtc 


ggg 


tgc 


cat 


caa 


ccg 


ata 


959 


147 


lie 


Asp 


Arq 


Cys 


His 


Pro 


Val 


Arg 


Asp 


Va .1 


Gly 


Cys 


His 


Gin 


Pro 


He 




148 










300 










305 










310 






150 


eta 


tqt 


cca 


gaa 


tgg 


cqt 


cac 


ttt 


cca 


gca 


gec 


caa 


cgc 


eg a 


get 


tgg 


1007 


151 


Leu 


Cys 


Pro 


Glu 


Trp 


Arg 


His 


Phe 


Pro 


Ala 


Ala 


Gin 


Arg 


Arg 


Ala 


Trp 




152 








315 










320 










325 








154 


taq 


tta 


etc 


tgg 


caa 


cga 


get 


caa 


cga 


tga 


tta 


ctg 


cac 


age 


tga 


gga 


1055 


155 




Leu 


Leu 


Trp 


Gin 


Arg 


Ala 


Gin 


Arg 




Leu 


Leu 


His 


Ser 




Gly 




156 








330 










335 














340 




158 




aaa 


att 


egg 


egg 


ate 


etc 


ttt 


etc 


aga 


caa 


ggg 


egg 


cct 


gac 


tea 


1103 


159 


Gl v 


Arg 


lie 


Arg Arg 


lie 


Leu 


Phe 


Leu 


Arg 


Gin 


Gly 


Arg 


Pro 


Asp 


Ser 




160 










345 










350 










35 5 






162 


att 


caa 


ga a 


ggc 


tac 


etc 


tqq 


egg 


cat 


qqt 


tct 


cqt 


eat 


gag 


tct 


gtg 


1151 


163 


Val 


Gin 


Glu 


Gly 


Tyr 


Leu 


Trp 


Arg 


His 


Gly 


Ser 


Gly 


His 


Glu 


Ser 


Val 




: 64 








360 










3 65 










37 0 








166 


qqa 


tga 


tta 


eta 


cgc 


caa 


eat 


get 


gtg 


get 


gg a 


etc 


cac 


eta 


ccc 


gac 


1199 


167 


Gly 




Leu 


Leu Arg 


Gin 


His 


Ala 


Val 


Ala 


Gly 


Leu 


His 


Leu 


Pro 


Asp 




168 








37 5 










380 










385 








170 


aaa 


eqa 


qac 


etc 


etc 


cac 


ace 


egg 


tge 


cgt 


gcg 


egg 


aag 


ctg 


etc 


cac 


1247 


1 71 


liVS 


Arq 


Asp 


Leu 


Leu 


His 


Thr 


Arg 


Cys 


Arg 


Ala 


Arg 


Lys 


Leu 


Lea 


His 




172 






390 










395 










400 










174 


cag 


etc 


egg 


tgt 


ccc 


tqe 


tea 


ggt 


cga 


ate 


tea 


gtc 


tec 


caa 


cgc 


caa 


12 95 


17 b 


Gin 


Leu 


Arg 


Cys 


Pro 


Cvs 


Ser 


Gly 


Arg 


lie 


Ser 


Val 


Ser 


Gin 


Arg 


Gin 




176 




4 05 










410 










415 












1 7 8 


qqt. 


cac 


ctt 


etc 


caa 


cat 


caa 


gtt 


egg 


aec 


cat 




cag 


cac 


egg 


caa 


1343 


179 


Gly 


His 


Leu 


lieu 


Gin 


His 


Gin 


Val 


Arg 


Thr 


His 


Trp 


Gin 


His 


Arg 


Gin 




180 


420 










425 










4 30 










4 35 




3 82 


ccc 


tag 


egg 


egg 


caa 


ccc 


tec 


egg 


egg 


aaa 


ccc 


gec 


tgg 


cac 


cac 


cac 


1391 


183 


Pro 




Arg 


Arg 


Gin 


Pro 


Ser 


Arg 


Arg 


Lys 


Pro 


Ala 


Trp 


His 


His 


His 




184 












440 










445 










450 




18 6 


cac 


ccg 


ccg 


cce 


age 


cac 


tac 


cac 


tgg 


aag 


etc 


tec 


egg 


ace 


tac 


cca 


1439 


1.87 


His 


Pro 


Pro 


Pro 


Ser 


His 


Tyr 


His 


Trp 


Lys 


Leu 


Ser 


Arg 


Thr 


Tyr 


Pro 




188 










455 










460 










465 






190 


gtc 


tea 


eta 


egg 


cca 


gtg 


egg 


egg 


tat 


tgg 


eta 


cag 


egg 


cce 


cac 


ggt 


1487 


191 


Val 


Ser 


Leu 


Arg 


Pro 


Val 


Arg Arg 


Tyr 


Trp 


Leu 


Gin 


Arg 


Pro 


His 


Giy 




192 








470 










475 










480 








194 


ctg 


ego 


cag 


egg 


cac 


aac 


ttg 


cca 


ggt 


cor 


gmc 


cct 


tac 


tac 


tct 


cag 


1535 


> 195 


Leu 


Arg 


Gin 


Arg 


His 


Asn 


Leu 


Pro 


Gly 


Pro 


Xaa 


Pro 


Tyr 


Tyr 


Ser 


Gin 




196 






485 










4 90 










495 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/031 , 496B 



DATE: 03/12/2004 
TIME : 14:56:14 



Input Set : A:\NREL 99-45.ST25.txt 
Output Set: N:\CRF4\03122004\J031496B.raw 



190 tgc ctq taa age tec 

199 Cys Leu Ser Ser 

200 ^ 500 



1550 



203 <210> SEQ ID NO: 5 

204 <211> LENGTH: 78 

205 <212> TYPE: PRT 

206 <213> ORGANISM: Trichoderma reesei 
208 <400> SEQUENCE: 5 

210 Val Ser Glu Val Gly Arq His Leu Gly Lgu Leu Gly His Ser Ser Cys 

211 ]. 5 10 15 . 

214 Ser Val Gly Leu His Ser Pro lie Gly Asp Ser Pro Ala Ser Asp Mot 

215 ' 20 25' 30 

218 Ala Giu Met Leu Val Trp Trp His Val His Ser Thr Asp Arg Leu Arg 

219 35 ^ 40 45 

222 Gly His Arg Arg Gin Leu Ala Leu Asp Ser Arg Tyr Glu Gin Gin His 

223 50 . 55 60 

226 Glu Leu Leu Arg Trp Gin His Leu Glu Leu Asp Pro Met Ser 

227 65 ' 70 75~ 

230 <210> SEQ ID NO: 6 

231 <211> LENGTH: 25 

232 <212> TYPE: PRT 

233 <213> ORGANISM: Trichoderma reesei 
235 <400> SEQUENCE: 6 

237 Gin Arg Asp Leu Arg Glu Glu Leu Leu Ser Gly Arg Cys Arg Leu Arg 

238 1 5 10 15 

241 Val His Val Arq Ser Tyr His Glu Arg 

242 20 25 

245 <210> SEQ ID NO: 7 

246 <211> LENGTH: 42 

247 <212> TYPE: PRT 

248 <213> ORGANISM: Trichoderma reesei 
250 <40C> SEQUENCE: 7 

252 Gin Pro Leu His Trp Leu Cys His Pro Val Cys Ala Glu Glu Arg Trp 

253 1 5 10 ■ 15 

256 Arg Ser Pro Leu Pro Tyr Gly Glu Arg His Asp Leu Pro Gly lie His 

257 ' 20 ~ ' 25 ' 30 
2 60 Pro Ala Trp Glu Arg Val Leu Phe Arg Cys 

261 35 40 

264 <210> SEQ ID NO: 8 

265 <211> LENGTH: 40 
2 66 <212> TYPE: PRT 

267 <23 3> ORGANISM: Trichoderma reesei 
269 <400> SEQUENCE: 8 

271 Cys Phe Ala A.l a Ala Val Arg Leu Gin Arg Ser Ser Leu Leu Arg Val 

272 1 5 10 15 

275 His Gly Arg Gly Trp Trp Arg G.'lu Gin Val Ser His Gin His Arg Trp 

276 * 20 25 30 
27 9 Arg Gin Val Arg His Gly Val Leu 

280 35 40 
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RAW SEQUENCE LISTING DATE: 03/12/2004 

PATENT APPLICATION: US/10/031 , 496B TIME: 14:56:14 

Input Set : A:\NREL 99-45.ST25.txt 
Output Set: N:\CRF4\03122004\J031496B.raw 

283 <210> SEQ ID NO: 9 

284 <211> LENGTH: 16 
28b <212> TYPE: PKT 

286 <213> ORGANISM: Trichodcrma reesei 
288 <400> SEQUENCE: 9 

290 Gin Pro Val Ser Pro Arg Ser Glu Val His Gin Trp Pro Gly GJ n Arg 

291 1 5 10 15 

294 <21U> SEQ ID NO: 10 

295 <211> LENGTH: 21 

296 <212> TYPE: PRT 

2 97 <213> ORGANISM: Trichodermn reesei 
2 99 <4 0 0> SEQUENCE : 10 

301 Gly Leu Gly Ala Val He Gin Gin Arg Glu His Gly Hrs Trp Arg Thr 

302 1 5 10 15 
30 5 Arg Lys Leu Leu Leu 

306 20 

309 <210> SEQ ID NO: 11 

310 <211> LENGTH: 28 

311 <212> TYPE: PRT 

312 <213> ORGANISM: Trichoderma reesei 
314 <4 00> SEQUENCE: 11 

316 Asp Gly Tyr Leu Gly Gly Gin Lou His Leu Arg Gly Ser Tyr Pro Pro 

317 1 ^ 5 10 15 
32 0 Pro Leu His Asp Cys Arg Pro Gly Asp Leu Arg Gly 

321 20 25 

324 <210> SEQ ID NO: 12 
32 5 <211> LENGTH: 8 
32 6 <212> TYPE: PRT 

327 <213> ORGANISM: Trichoderma reesei 

32 9 <4 00> SEQUENCE: 12 

331 Trp Val Arg Arg Asn Leu Leu Arg 

332 1 5 
335 <210> SEQ ID NO: 13 

33 6 <211> LENGTH: 69 

337 <212> TYPE: PRT 

338 <213> ORGANISM: Trichoderma reesei 
340 <400> SEQUENCE: 13 



34 2 


Gin 


He 


Trp 


Arg 


His 


Leu 


Arg 


Ser 


Arg 


Trp Leu 


Arg 


Leu 


Glu 


Pro 


He 


343 


1 








5 










10 








15 




346 


Pro 


Pro 


Gly 


Gin 


nis 


Gin 


Leu 


Leu 


Arg 


Pro Trp 


Leu 


Lys 


Leu 


Tyr 


Pro 


347 






20 










25 








30 






35 0 


Arg 


Tyr 


His 


Gin 


Glu 


He 


Asp 


Arg 


Cys 


His Pro 


Val 


Arg 


Asp 


Val 


Gly 


351 




35 










40 








45 








354 


Cys 


His 


Gin 


Pro 


He 


Leu 


Cys 


Pro 


Glu 


Trp Arg 


His 


Phe 


Pro 


Ala 


Ala 


355 




50 










55 








60 










35 3 


Gin 


Arg 


Arg 


Ala 


Trp 























35 9 65 

362 <210> SEQ ID NO: 14 

363 <211> LENGTH: 8 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE; 03/12/2 004 

PATENT APPLICATION: US/10/031 , 4 96B TIME: 14:56:15 

Input Set. : A:\NREL 99-4 5.ST25.txt 
Output. Set: N:\CRF4\03122004\J031496B.raw 

Please Note: 

Use of n and/ or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:4; Xaa Pos . 4 93 
Seq#:lH; Xaa Pos. 57 
Seq#:32; Xaa Pos. 57 
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VERIFICATION SUMMARY DATE: 03/12/2004 

PATENT APPLICATION: US/10/031 , 496B TIME: 14:56:15 

Input Set : A:\NREL 99-45.ST25.txt 
Output Set: N:\CRF4\031220O4\J031496B.raw 

L:195 M:258 W: Mandatory Feature missing, <223> Tag not found for SF.Q ID#:4 

L:195 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 after pos.:1535 

L:45C M:341 VJ: (46) M n" or "Xaa" used, for SEQ ID#:18 after pos.:48 

L:702 M:341 W: (46) "n" or "Xan" used, for SEQ ID#:32 after pos.:48 



file://C:\CRF4\Outhold\VsrJ03 1 496B.htm 



3/12/04 



